SEQUENCE LISTING 
(1) GENERAL INFORMATION: 

(i) APPLICANT : COSGROVE, DANIEL J. ; 

GUILTINAN, MARK; 
SHCHERBAN, TATYANA; 
SHI, JUN 

(ii) TITLE OF INVENTION: PURIFIED EXPANSIN PROTEINS 

(iii) NUMBER OF SEQUENCES: 6 

(iv) CORRESPONDENCE ADDRESS: 

(A) INTELLECTUAL PROPERTY OFFICE, THE PENNSYLVANIA 

STATE UNIVERSITY 

(B) STREET: 113 . TECHNOLOGY CENTER 

(C) CITY: UNIVERSITY PARK 

(D) STATE: PENNSYLVANIA 

(E) COUNTRY: UNITED STATES OF AMERICA 

(F) ZIP: 16802-7000 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: FLOPPY DISK 

(B) COMPUTER: NEC 286 

(C) OPERATING SYSTEM: DOS 

(D) SOFTWARE: WORDPERFECT 5.1 
(Vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(2) INFORMATION FOR SEQ ID NO: 1: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 681 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : SINGLE 

(D) TOPOLOGY: UNKNOWN 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

GAC TAC GGT GGC TGG CAG AGC GGC CAC GCC ACC TTT TAT GGT 42 
Asp Tyr Gly Gly Trp Gin Ser Gly His Ala Thr Phe Tyr Gly 
15 10 

GGT GGT GAC GCA TCT GGC ACC ATG GGT GGA GCT TGT GGG TAT 84 
Gly Gly Asp Ala Ser Gly Thr Met Gly Gly Ala Cys Gly Tyr 
15 20 25 

GGG AAT TTA TAC AGC CAA GGG TAT GGC ACG AAC ACG GTG GCG 126 
Gly Asn Leu Tyr Ser Gin Gly Tyr Gly Thr Asn Thr Val Ala 
30 35 40 

CTG AGC ACT GCG CTA TTT AAC AAT GGA TTA AGT TGT GGT GCT 168 
Leu Ser Thr Ala Leu Phe Asn Asn Gly Leu Ser Cys Gly Ala 
45 50 55 

TGC TTC GAA ATG ACT TGT ACA AAC GAC CCT AAA TGG TGC CTT 210 
Cys Phe Glu Met Thr Cys Thr Asn Asp Pro Lys Trp Cys Leu 
60 65 70 

CCG GGA ACT ATT AGG GTC ACT GCC ACC AAC TTT TGC CCT CCT 252 
Pro Gly Thr lie Arg Val Thr Ala Thr Asn Phe Cys Pro Pro 
75 80 

AAC TTT GCT CTC CCT AAC AAC AAT GGT GGA TGG TGC AAC CCT 294 
Asn Phe Ala Leu Pro Asn Asp Asp Gly Gly Trp Cys Asn Pro 
85 90 95 

CCT CTC CAA CAC TTC GAC ATG GCT GAG CCT GCC TTC CTT CAA 336 
Pro Leu Gin His Phe Asp Met Ala Glu Pro Ala Phe Leu Gin 
100 105 110 

ATC GCT CAA TAC CGA GCT GGT ATC GTC CCC GTC TCC TTT CGT 378 
lie Ala Gin Tyr Arg Ala Gly lie Val Pro Val Ser Phe Arg 
115 120 125 



AGG GTA CCA TGT ATG AAG AAA GGT GGA GTG AGG TTT ACA ATC 420 
Arg Val Pro Cys Met Lys Lys Gly Gly Val Arg Phe Thr lie 
130 135 140 



AAT GGC CAC TCA TAC TTC AAC CTC GTT TTG ATC ACA AAC GTC 462 
Asn Gly His Ser Tyr Phe Asn Leu Val Leu lie Thr Asn Val 

145 150 

GGT GGC GCA GGC GAC GTC CAC TCT GTG TCG ATA AAG GGG TCT 504 
Gly Gly Ala Gly Asp Val His Ser Val Ser lie Lys Gly Ser 
155 160 165 

CGA ACT GGA TGG CAA TCC ATG TCT AGA AAT TGG GGC CAA AAC 546 
Arg Thr Gly Trp Gin Ser Met Ser Arg Asn Trp Gly Gin Asn 
170 175 180 

TGG CAA AGC AAC AAC TAT CTC AAT GGC CAA GGC CTT TCC TTT 588 
Trp Gin Ser Asn Asn Tyr Leu Asn Gly Gin Gly Leu Ser Phe 
185 190 195 

CAA GTC ACT CTT AGT GAT GGT CGC ACT CTC ACT GCC TAT AAT 630 
Gin Val Thr Leu Ser Asp Gly Arg Thr Leu Thr Ala Tyr Asn 
200 205 210 

CTC GTT CCT TCC AAT TGG CAA TTT GGC CAA ACC TAT GAA GGC 672 
Leu Val Pro Ser Asn Trp Gin Phe Gly Gin Thr Tyr Glu Gly 

215 220 

CCT CAA TTC 681 

Pro Gin Phe 

225 

(3) INFORMATION FOR SEQ ID NO: 2: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 228 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: UNKNOWN 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Ala Gly Gly Gly Trp Val Asn Ala His Ala Thr Phe Tyr Gly Gly 
15 io 15 

Gly Asp Ala Ser Gly Thr Met Gly Gly Ala Cys Gly Tyr Gly Asn 

20 25 30 

Leu Tyr Ser Gin Gly Tyr Gly Thr Asn Thr Ala Ala Leu Ser Thr 

35 40 45 



Ala Leu Phe Asn Asn Gly Leu Ser Cys Gly Ala Cys Phe Glu lie 

50 55 60 



Arg Cys Gin Asn Asp Gly Lys Trp Cys Leu Pro Gly Ser He Val 

65 70 75 

Val Thr Ala Thr Asn Phe Cys Pro Pro Asn Asn Ala Leu Pro Asn 

80 85 go 

Asn Ala Gly Gly Trp Cys Asn Pro Pro Gin Gin His Phe Asp Leu 

95 100 105 

Ser Gin Pro Val Phe Gin Arg He Ala Gin Tyr Arg Ala Gly He 

HO 115 120 

Val Pro Val Ala Tyr Arg Arg Val Pro Cys Val Arg Arg Gly Gly 

125 130 135 

He Arg Phe Thr He Asn Gly His Ser Tyr Phe Asn Leu Val Leu 

140 145 150 

He Thr Asn Val Gly Gly Ala Gly Asp Val His Ser Ala Met Val 

155 160 165 

Lys Gly Ser Arg Thr Gly Trp Gin Ala Met Ser Arg Asn Trp Gly 

170 175 iso 

Gin Asn Trp Gin Ser Asn Ser Tyr Leu Asn Gly Gin Ser Leu Ser 

185 190 195 

Phe Lys Val Thr Thr Ser Asp Gly Gin Thr He Val ser Asn Asn 

200 205 210 

Xaa Ala Asn Ala Gly Trp Ser Phe Gly Gin Thr Phe Thr Gly Ala 

215 220 2 25 

His Val Arg 

(4) INFORMATION FOR SEQ ID NO: 3: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 222 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: UNKNOWN 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

His Met Gly Pro Trp He Asn Ala His Ala Thr Phe Tyr Xaa Xaa 
15 io i 5 



-65- 



m. 



Hi 



Gly Asp Ala Xaa Xaa Thr Met Gly Gly Ala Cys Gly Tyr Gly Asn 
130 20 25 30 

Leu Tyr Ser Gin Gly Tyr Gly Leu Glu Thr Ala Ala Leu Ser Thr 

35 40 45 

Ala Leu Phe Asp Gin Gly Leu Ser Cys Gly Ala Cys Xaa Glu Leu 

50 55 60 

135 Met Cys Val Asn Asp Pro Gin Trp Cys lie Lys Gly Arg Ser lie 

65 70 75 

Val Val Thr Ala Thr Asn Phe Cys Pro Pro Gly Gly Ala Cys Asp 

80 85 90 

Pro Pro Asn His His Phe Asp Leu Ser Gin Pro lie Tyr Glu Lys 
140 95 100 105 

lie Ala Leu Tyr Lys Ser Gly lie He Pro Val Met Tyr Arg Arg 

HO 115 120 

Val Arg Cys Lys Arg Ser Gly Gly lie Arg Phe Thr He Asn Gly 

125 130 135 

145 His Ser Tyr Phe Asn Leu Val Leu Val Thr Asn Val Gly Gly Ala 

140 145 150 

Gly Asp Val His Ser Val Ser Met Lys Gly Ser Arg Thr Lys Trp 

155 160 165 

Gin Leu Met Ser Arg Asn Trp Gly Gin Asn Trp Gin Ser Asn Ser 
150 170 175 180 

Tyr Leu Asn Gly Gin Ser Leu Ser Phe Val Val Thr Thr Ser Asp 

185 190 195 

Arg Arg Ser Val Val Ser Phe Asn Val Ala Pro Pro Thr Trp Ser 

200 205 210 



155 Phe Gly Gin Thr Tyr Thr Gly Gly Gin Phe Arg Tyr 

215 220 



(5) INFORMATION FOR SEQ ID NO: 4: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 227 

(B) TYPE: AMINO ACID 

( D ) TOPOLOGY : UNKNOWN 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Lys XAA Ser Val Ala Gin Ser Ala Phe Ala Thr Phe Tyr Gly Gly 
15 10 15 

Lys Asp Gly Ser Cys Thr Met Gly Gly Ala Cys Gly Tyr Gly Asn 

20 25 30 



Leu Tyr Asn Ala Gly Tyr Gly Leu Tyr Asn Ala Ala Leu Ser Ser 

35 40 45 

Ala Leu Phe Asn Asp Gly Ala Met Cys Gly Ala Cys Tyr Thr lie 

50 55 60 

Thr Cys Asp Thr Ser Gin Thr Lys Trp Cys Lys Pro Gly Gly Asn 

65 70 75 

Ser lie Thr lie Thr Ala Thr Asn Leu Cys Xaa Pro Asn Trp Ala 

80 85 90 

Leu Pro Ser Asn Ser Gly Gly Trp Cys Asn Pro Pro Leu Xaa His 

95 100 105 

Phe Asp Met Ser Gin Pro Ala Trp Glu Asn lie Ala Val Tyr Gin 

110 115 120 

Ala Gly lie Val Pro Val Asn Tyr Lys Arg Val Pro Xaa Gin Arg 

125 130 135 

Ser Gly Gly lie Arg Phe Ala lie Ser Gly His Asp Tyr Phe Glu 
140 145 150 

Leu Val Thr Val Thr Asn Val Gly Gly Ser Gly Val Val Ala Gin 

155 160 165 

Met Ser lie Lys Gly Ser Asn Thr Gly Trp Met Ala Met Ser Arg 

170 175 180 

Asn Trp Gly Ala Asn Trp Gin Ser Asn Ala Tyr Leu Ala Gly Gin 

185 190 195 
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Ser Leu Ser Phe lie Val Gin Leu Asp Asp Gly Arg Lys Val Thr 

200 205 210 



Ala Trp Asn Xaa Ala Pro Xaa Asn Trp Leu Xaa Xaa Xaa Xaa Xaa 

215 220 225 

Xaa Xaa 

(6) INFORMATION FOR SEQ ID NO: 5: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 225 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: UNKNOWN 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Asp Asn Gly Gly Trp Glu Arg Gly His Ala Thr Phe Tyr Gly Gly 
15 10 15 

Ala Asp Ala Ser Gly Thr Met Gly Gly Ala Cys Gly Tyr Gly Asn 

20 25 30 

Leu His Ser Gin Gly Tyr Gly Leu Gin Thr Ala Ala Leu Ser Thr 

35 40 45 

Ala Leu Phe Asn Ser Gly Gin Lys Cys Gly Ala Cys Phe Glu Leu 

50 55 60 

Thr Cys Glu Asp Asp Pro Glu Trp Cys lie Pro Gly Ser lie lie 

65 70 75 

Val Arg Tyr Asn Leu Ala Asn Phe Ala Leu Ala Asn Asp Asn Gly 

80 85 90 

Gly Trp Cys Asn Pro Pro Leu Lys His Phe Asp Leu Ala Glu Pro 

95 100 105 

Ala Phe Leu Gin lie Ala Gin Tyr Arg Ala Gly lie Val Pro Val 

110 115 120 

Ala Phe Arg Arg Val Pro Cys Glu Lys Gly Gly Gly lie Arg Phe 

125 130 135 

Thr lie Asn Gly Asn Pro Tyr Phe Asp Leu Val Leu lie Thr Asn 

140 145 150 

Val Gly Gly Ala Gly Asp lie Arg Ala Val Ser Leu Lys Gly Ser 

155 160 165 
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Lys Thr Asp Gin Trp Gin Ser Met Ser Arg Asn Trp Gly Gin Asn 

170 175 180 



Trp Gin Ser Asn Thr Tyr Leu Arg Gly Gin Ser Leu Ser Phe Gin 

185 190 195 

Val Thr Asp Ser Asp Gly Arg Thr Val Val Ser Tyr Asp Val Val 

200 205 210 

Pro His Asp Trp Gin Phe Gly Gin Thr Phe Glu Gly Gly Gin Phe 

215 220 225 



(7) INFORMATION FOR SEQ ID NO: 6: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 226 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: UNKNOWN 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Asp Tyr Ser Ser Trp Gin Ser Ala His Ala Thr Phe Tyr Gly Gly 

15 10 15 

Gly Asp Ala Ser Gly Thr Met Gly Gly Thr Cys Gly Tyr Gly Asn 

20 25 30 

Leu Tyr Ser Thr Gly Tyr Thr Asn Thr Ala Ala Leu Ser Thr Val 

35 40 45 

Leu Phe Asn Asp Gly Ala Ala Cys Arg Ser Cys Tyr Glu Leu Arg 

50 55 60 

Cys Asp Asn Asp Gly Gin Trp Cys Leu Pro Gly Ser Val Thr Val 

65 70 75 

Thr Ala Thr Asn Leu Cys Pro Pro Asn Tyr Ala Leu Pro Asn Asp 

80 85 90 

Asp Gly Gly Trp Cys Asn Pro Pro Arg Pro His Phe Asp Met Ala 

95 100 105 

Glu Pro Ala Phe Leu Gin lie Gly Val Tyr Arg Ala Gly lie Val 

110 115 120 



Pro Val Ser Tyr Arg Arg Val Pro Cys Val Lys Lys Gly Gly lie 

125 130 135 



Arg Phe Thr lie Asn Gly His Ser Tyr Phe Asn Leu Val Leu Val 

140 145 150 



Thr Asn Val Ala Gly Pro Gly Asp Val Gin Ser Val Ser lie Lys 

155 160 165 

Gly Ser Ser Thr Gly Trp Gin Pro Met Ser Arg Asn Trp Gly Gin 

170 175 180 

Asn Trp Gin Ser Asn Ser Tyr Leu Asp Gly Gin Ser Leu Ser Phe 

185 190 195 



Gin Val Ala Val Ser Asp Gly Arg Thr Val Thr Ser Asn Asn Val 

200 205 210 

Val Pro Ala Gly Trp Gin Phe Gly Gin Thr Phe Glu Gly Gly Gin 

215 220 225 



Phe 



